Stream Vector Processing Unit: Stream Processing Using SIMD on a General Purpose Processor
نویسندگان
چکیده
Hypothesis Modern scalar processors inefficiently use fetch bandwidth when executing vectorizable code. By augmenting a general-purpose processor with a Stream Vector Processing Unit (SVPU), we can use the fetch bandwidth much more efficiently and achieve speed-ups in performance on vectorizable code. We also believe that by using a hierarchy of register files, this architecture will use memory bandwidth more efficiently by exploiting locality in data streams.
منابع مشابه
Compiling Data Dependent Control Flow on SIMD GPUs
Current Graphic Processing Units (GPUs) (circa. 2003/2004) have programmable vertex and fragment units. Often these units are implemented as SIMD processors employing parallel pipelines. Data dependent conditional execution on SIMD architectures implemented using processor idling is inefficient. I propose a multi-pass approach based on conditional streams which allows dynamic load balancing of ...
متن کاملHardware Acceleration for CGP: Graphics Processing Units
Graphic Processing Units (GPUs) are fast, highly parallel units. In addition to processing 3D graphics, modern GPUs can be programmed for more general-purpose computation. A GPU consists of a large number of ‘shader processors’, and conceptually operates as a single instruction multiple data (SIMD) or multiple instruction multiple data (MIMD) stream processor. A modern GPU can have several hund...
متن کاملBottlenecks in Multimedia Processing with SIMD Style Extensions and Architectural Enhancements
Multimedia SIMD extensions such as MMX and AltiVec speedup media processing, however, our characterization shows that the attributes of current general-purpose processors enhanced with SIMD extensions do not match very well with the access patterns and loop structures of media programs. We find that 75-85% of the dynamic instructions in the processor instruction stream are supporting instructio...
متن کاملGPGPU: General Purpose Computing on Graphics Hardware
The graphics processor (GPU) on current personal computers has developed into a rich and stable platform for developing scientific applications which do not fall under the category of traditional computer graphics software. The application of the GPU as a SIMD processor for scientific computing has proven to be a very powerful tool to an extent that various graphics card manufacturers have star...
متن کاملEnergy-Efficient Architecture for DP Local Sequence Alignment: Exploiting ILP and DLP
Typical approaches to solve Dynamic Programming algorithms explore data level parallelism by relying on specialized vector instructions. However, the fully-parallelizable scheme is often not compliant with the memory organization of general purpose processors, leading to a less optimal parallelism exploitation, with worse performance. The proposed processor architecture overcomes this issue by ...
متن کامل